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[57] ABSTRACT 

A system extracts, from uncoded video data, coding param- 
eters that correspond substantially to coding parameters used 
in a coding operation that was previously performed on the 
video data. Tlie system generates a histogram for at least one 
AC frequency band in the uncoded video data and calculates 
a correlation function for the histogram. A maximum value 
of the correlation function is then identified, and a coding 
parameter for the at least one AC frequency band is deter- 
mined using the maximum value of the correlation function. 

13 Claims, 6 Drawing Sheets 



C START ) 

_I_ 



GENERATE HISTOGRAM 



I 



S601 



CALCULATE CORRELATION FUNCTION 
FOR HISTOGRAM 



I 



8602 



IDENTIFY MAXIMUM VALUE 
OF CORRELATION FUNCTION 



I 



S603 



DETERMINE CODING PARAMETERS 



(jnT) 



-S604 



05/27/2004, EAST Version: 1.4.1 



U.S. Patent 



Aug. 8, 2000 



Sheet 1 of 6 



6,101,278 



SOURCE 



A. 



ENCODER 



DECODER 



ENCODER 


► 


DECODER 






► 


5 


FIG.1 


7 





INPUT 



REDUNDANT 
FED/FRAME 
DEIECTION 



X 



27 

A 



ME & ME MODE 
SELECTION 



I 



FRAME STORE 



BIT 



STREAM 
\ 
31 



BUFFER 



I.P 



MACROBLOCK 

COLOR 
CLASSIRCATION 



I.P 
— ► 



-11 



OCT 



1 



14 



ADAPTIVE 
RELD/FRAME 
DECISION 



26 



DCT 
^ 



12 



I.P 
— ► 



16 



FIRST-PASS 
CODING 



MATRIX 
CALCULATION/ 
EXTRACTION 



-29 



SECOND-PASS 
CODING 



'20 



ADAPTIVE 
QUANTIZATION 
CONTROL 



17 



19 



-30 



24 



HISTORY 



22- 



21 



RATE 
ALLOCATION 



FIG. 2 



05/27/2004, EAST Version: 1.4.1 



U.S. Patent 



Aug. 8,2000 



Sheet 2 of 6 



6,101,278 



( start) 



INPUT VIDEO DATA 



•8301 



S302 



VIDEO DATA \ NO 
PREVIOUSLY CODED? 






YES 






f 




EXTRACT CODING PARAMETERS 


_^S305 


FROM VIDEO DATA 






/S306 



CODE VIDEO DATA USING 
EXTRACTED PARAMETERS 



S303 

A. 



CODE VIDEO DATA USING 
DEFAULT/CALCULATED MATRIX 



TRANSMIT CODED 
VIDEO DATA 




r 



-S304 



FIG. 3 



05/27/2004, EAST Version: 1.4.1 



U.S. Patent Aug. 8, 2000 sheet 3 of 6 6,101,278 




U.S. Patent Aug. 8, 2000 sheet 4 of 6 6,101,278 



( START ) 



GENERATE HISTOGRAM 



•S601 



CALCULATE CORRELATION FUNCTION 
FOR HISTOGRAM 



IDENTIFY MAXIMUM VALUE 
OF CORRELATION FUNCTION 



DETERMINE CODING PARAMETERS 



S602 



S603 



-S604 



1 

( END ) 

FIG. 6 




05/27/2004, EAST Version: 1.4.1 



U.S. Patent 



Aug. 8, 2000 



Sheet 5 of 6 



6,101,278 



cc 

s 

CO 



1 1 1 ^-1 1 

• 


* 


















— — 1 




1 






0 


1 

5 


10 


15 


20 


25 


30 



FRAME NUMBER 

FIG. 8A 



CALCULAra MATRICES 

DEFAULT MATRICES 

EXTRACTED MATRICES 




10 15 20 
FRAME NUMBER 
U 



30 



FIG. 8B 



CALCULATED MATRICES 
DEFAULT MATfllCES 
EXTRACTED MATRICES 




10 15 20 
FRAME NUMBER 

fig' 8C 



CALCULATEDMATRICES 

DEFAULT MATRICES 

EXTRACe MATRICES 



05/27/2004, EAST Version: 1.4.1 



U.S. Patent 



Aug. 8, 2000 



Sheet 6 of 6 



6,101,278 




FRAME NUMBER 

FIG^QA 



CALCUWTED MATRICES 

DEFAULT MATRICES 

EXTRACTED MATRICES 



1.5i 




1.5- 



10 15 
FRAME NUMBER 



20 



25 



30 



CALCUWTED MATRICES 
DEFAULT MATRICES 
EXTRACTED MATRICES 




05/27/2004, EAST Version: 1.4.1 



6,101,278 

1 2 

SYSTEM FOR EXTRACTING CODING coding, coding parameters from a previous coding operation 

PARAMETERS FROM VIDEO DATA are evident based on the uncoded video daU. Specifically, 

the invention performs a series of processes on AC fre- 

The invention described herein was developed pursuant quency bands in the video data in order to extract these 

to United States Government Contract No. s coding parameters from the imcoded video data. 

70N1NB5H1174, which was awarded by the Nationallnsti- More specifically, according to one aspect, the present 

tute For Standards And Technology (NISI). invention is a method, and a corresponding apparatus, which 

extracts coding parameters from uncoded video data, where 

BACKGROUND OF THE INVENTION the coding parameteis corre^ond substantiaUy to coding 

1. Field of the Invention parameters used in a coding operation that was previously 

~ ^. ^ - j-^j. . L-i- performed on the video data. The method includes gcncrat- 

The present mvention is directed to a system which • a^c. uj-.t. 

^ ^ c J J J J * J- * 1 * mgdi histogram for at least one AC frequency band m the 

extracts, from uncoded video data, codmg parameters relat- j j -T j . j i i a c r 

. -ru • uncoded video data and calculatmg a correlation function for 
mg to a previous coding process. The mvention has partial- *l u- * * • i r.i. i ^- xl .- 
, ^.i.^*^. „ ,^ , J. . . /• . the histogram. A maximum value of the correlation function 
lar uulity m muIti-generational codmg systems, m whidi the ic - -j j u & j- * r *i. * i * 
*j J * • J J J J J J u r IS identified, whereafter a codmg parameter for the at least 
same video data is coded and decoded a plurality of times. Ar> c uj j* -j • *i. 

' one AC frequency band is determmed usmg the maximum 

2. Description of the Related Art value of the correlation fimction. 

Generally speaking, a muhi-generational coding system ^y taking advantage of coding parameters which arc 
mcludes encoder/decoder pairs arranged in scries relative to essentially already within the video data, the present inveo- 
transmission media. In operation, each of the encoders is 20 ^^^^ parameters relating to a 
adapted to code uncoded video data and then to transmit that previous coding process without requiring the use of a mole 
uncoded video data over a transmission medium to a or the like. In this regard, because the invention codes video 
decoder which is adapted to decode the coded video data. data using extracted coding parameters, rather than calcu- 
These systems are in wide use today, particularly where coding parameters, the invention introduces relatively 
frequent transmission of video data from one location to 25 ^^^^ quantization noise into the video data. Moreover, the 
another is required, such as in the television industry. invention is more amenable than its conventional counter- 
Problems arise in multi-generational coding systems of parts to possible alterations of the decoded video, 
the foregoing type due to the way in which encoders in these Furthermore, because the invention can be implemented 
systems code video data. More specifically, in these systems, substantially within software, the need for specially- 
different encoders code the same video data by recalculating ^ designed hardware is reduced. 

coding parameters for each coding operation. Differences in iq preferred embodiments of the invention, the step of 
coding parameters resulting from these calculations cause calculating the correlation function includes identifying a 
quantization noise to be introduced into the video data. predetermined number of peaks in the histogram of the 
Eventually, following several generations of coding, suffi- uncoded video data, and calculating (i) a standard auto- 
cient quantization noise is introduced into the video data to correlation function for the histogram in a case that the 
cause a marked degradation in video image quality. number of peaks is less than or equal to a predetermined 
One conventional solution to the foregoing problem value, where the standard auto-correlation function is per- 
involves introducing additional data, termed a "mole", into formed for all identified peaks, and (ii) a modified auto- 
decoded video data. This additional data defines coding ^ correlation function for the histogram in a case that- the 
parameters to be used for the video data, and is transmitted, number of peaks is greater than the predetermined value, 
along with the video data, between encoders and decoders. where the modified auto-correlation function is performed 
Use of a mole, however, has several drawbacks. First, the for less than all of the identified peaks. By calculating a 
mole must be invisible, otherwise it would affect the quality modified auto-correlation function when the number of 
of images produced from the video data. Making sure that peaks is greater than the predetermined value, the invention 
the mole is invisible can be diflBcult from a technical is able to reduce calculation time without substantially 
standpoint. Second, special hardware may be required to reducing accuracy. 

transmit video data having a mole, otherwise the mole may the invention, the maximum value of the correlation 

be altered during signal transmission. FinaUy, hardware function corresponds to a differentiated dominant recon- 

modifications may be required to ensure that decoders are stmction level ("DDRL") of the at least one AC frequency 

capable of detectmg the mole and of separating the mole b^nd. Thus, in other preferred embodiments of the invention, 

from the video data. For at least these reasons, the mole, and the identifying step comprises determining whether the 

systems like it, have proven an madequate solution to the correlation function includes peaks which overlap and, in a 

above problem associated with multi-generational coding. case that peaks overlap, examining additional values of the 

Thus, there exists a need for a system which reduces the 55 correlation function in order to determine the DDRL. By 

amount of quantization noise introduced into video data virtue of this feamre, the invention is able to reduce errors 

during multiple coding processes, without introducing addi- in the determination of the DDRL which result from over- 

tional information into the video data itself lapping peaks in the histogram. 

SUMMARY OF THE INVENTION Similarly, another embodiment of the invention reduces 

60 errors in the DDRL by comparing first and second peaks of 

The present invention addresses the foregoing need by the correlation function to determine which of the first and 

providing a method of extracting, from uncoded video data, second peaks is larger and which is smaller, and determining 

parameters which correspond substantially to coding param- whether the larger peak is a harmonic of the smaller peak. In 

eteis used in a coding operation that was previously per- a case that the larger peak is a harmonic of the smaller peak, 

formed on the video data. To this end, the invention takes 65 the smaller peak is set as the maximum value; however, in 

advantage of the fact that in some coding techniques, such a case that the larger peak is not a harmonic of the smaller 

as Motion Picture Experts Group (hereinafter "MPEG") peak, the larger peak is set as the maximum value. 
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la still another preferred embodimeDt of the invention, the 
DDRL is determined based on an average of distances 
between dominant reconstruction levels in the histogram. 
Hiis reduces errors in the DDRL which may result from 
differences in distances between different dominant recon- 
struction levels. 

According to another aspect, the present invention is a 
method and corresponding apparatus, for use in a multi- 
generational coding system, of coding uncoded video data. 
The method first determines whether the uncoded video data 
has been subjected to a coding operation in a previous 
generation of the muhi-generational coding system. In a 
case that the method determines that the uncoded video data 
has not been subjected to a coding operation, the method 
codes the uncoded video data using predetermined coding 
parameters. On the other hand, in a case that the method 
determines that the uncoded video data has been subjected 
to a coding operation, the method extracts coding param- 
eters from the uncoded video data and codes the video data 
using the extracted coding parameters. 

By determining whether the uncoded video data has been 
subjected to a coding operation in a previous generation of 
the multi-generational coding system, and then proceeding 
accordingly based on this determination, the present inven- 
tion is able accurately to process both video data which has 
not previously been coded and video data which has previ- 
ously been coded. 

According to still another a^ect, the present invention is 
a multi-generational coding system which inchides first and 
second coding means. The first coding means codes original 
video data using predetermined coding parameters so as to 
produce coded video data, decodes the coded video data, and 
outputs uncoded video data. The second coding means 
receives the uncoded video data from the first coding means, 
determines that the uncoded video data has been subjected 
to a previous coding operation, and extracts coding param- 
eters from the uncoded video data. In this aspect of the 
invention, the coding parameters extracted by the second 
coding means corre^ond substantially to the predetermined 
coding parameters used by the first coding means. 

This brief summary has been provided so that the nature 
of the invention may be understood quickly. A more com- 
plete understanding of the invention can be obtained by 
reference to the following detailed description of the pre- 
ferred embodiments thereof in connection with the attached 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram showing a multi -generational 
coding system in which the present invention may be 
implemented. 

FIG. 2 is a block diagram of an encoder in accordance 
with the present invention. 

FIG. 3 is a flow diagram showing a coding process in 
accordance with the present invention. 

FIG. 4 shows a histogram of DCT coefficients for an AC 
frequency band of original video data >^ich has not been 
subjected to a coding operation. 

FIG. 5 shows a histogram of DCT coefficients for an AC 
frequency band of video data which has previoiisly been 
subjected to a coding operation. 

FIG. 6 shows a flow diagram of a quantization matrix 
extraction process in accordance with the present invention. 

FIG. 7 shows a comparison between values of a quanti- 
zation matrix extracted in accordance with the present 
invention and values of a default quantization matrix. 
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FIGS. 8A to 8C show a first set of simulation results 
comparing operation of matrix extraction in accordance with 
the present invention to conventional matrix calculation. 

FIGS. 9A to 9C show a second set of simulation results 
^ comparing operation of matrix extraction in accordance with 
the present invention to conventional matrix calculation. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 

The preferred embodiments of the invention are described 
herein with respect to a multi-generational codings system, 
namely coding system 1 shown in FIG. 1, in which video^ 
data is coded using MPEG2. It should be noted, however,' 
that other coding techniques, s uch as JPEG or the Hke, may 
be used in place o f MPEG2. ivloreover. It is further noted^ 
that the invention [bay be Implemented outside the context 
of a multi-generational coding system. 

As background, MPEG2 coding is performed on an image 
by dividing the image into macro-blocks of 16x16 pixels, 
each having a separate quantizer scale value (hereinafter 
"q„") associated therewith. The macro-blocks are then 
divided into individual blocks of 8x8 pixels. These 8x8 pixel 
blodcs are each subjected to a discrete cosine transform 
(hereinafter "DCT*) which generates DCT coefiEdents for 

^ each of the 64 frequency bands (i.e., pixels) therein. DCT 
coeflBcients in an 8x8 pixel block are then divided by a 
corresponding coding parameter, namely a quantization 
weight. Thereafter, additional calculations are efiEected on 

^ the DCT coefficients in order to take into account q„ among 
other things, and thereby complete MPEG2 coding. 

In MPEG2, the quantization weights for a particular 8x8 
pixel block are expressed in terms of an 8x8 quantization 
matrix. The invention described herein is concerned with 
extracting these quantization matrices from video data 
which has previously been subjected to a coding operation. 
As described in more detail below, this is possible, particu- 
larly in the case of MPEG2, because coding parameters, 
such as the quantization matrices noted above, can be 

^ determined based on reconstruction levels of DCT coeffi- 
cients in uncoded video data- 
Returning to FIG. 1, coding system 1 includes video 
source 2, encoders 4 and 5, decoders 6 and 7, and various 
transmission media (not shown) between the video source, 

45 encoders, and decoders. Signal processors (not shown) for 
performing image editing, display, etc. may also be included 
at different points in the system, such as between decoder 6 
and encoder 5, For simplicity's sake, however, these signal 
processors are omitted therefrom. 

50 Decoders 6 and 7 can comprise any conventional decoder 
capable of receiving MPEG2-coded video data and of 
decoding the coded video data. Likewise, source 2 can 
comprise any type of apparatus, such as a television camera 
or related video equipment, which is capable of generating 

55 video data based on an original image. Encoders 4 and 5, 
which are identical in both structure and function, include 
the present invention. A functional block diagram of encoder 
4 is shown in FIG. 2, From a hardware standpoint, functions 
of the blocks shown in FIG, 2, and in particular the blocks 

60 comprising the present invention, can be implemented pri- 
marily in software using one or more microprocessors which 
execute process steps stored in a memory. 

As shown in FIG. 2, encoder 4 includes redundant field/ 
frame detection block 9, macro-block color classification 

65 block 10, DCT blocks 11 and 12, adaptive field/frame 
decision block 14, first pass coding block 16, matrix 
calculation/extraction block 17, adaptive quantization con- 
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trol block 19, second pass coding block 20, history block 21, color classification block 10, first pass coding block 16, and 

rate allocation block 22, buffer 24, frame store 26, and redundant field/&ame detection block 9. Second pass coding 

motion estimation (hereinafter "ME") & ME mode selection blodt 20 receives information from adaptive quantization 

block 27. Bold lines in FIG. 2, such as line 29, indicate flow control block 19, and codes the video data based on this 

of video data, whereas narrow lines, such as line 30, indicate 5 information and based on the quantization weights provided 

flow of control data or a final output bit stream. Although the from matrix calculation/extraction block 17. Thereafter, the 

present invention is implemented primarily in matrix coded video data is buffered in buffer 24, and output from 

calculation/extraction block 17, a brief description of the encoder 4 as bit stream 31 

operationo£iheremaimngelementsofencoder4ispiovided nc. 3 is a flow diagram depicting cxxling of the video 

herem for the sake of completeness. j„ ^^^^ ^ accordance with the present invention. Specifically. 

More specifically, in operation, video data is received ^th reference to FIG. 1, in step S301 uncoded video data for 
fiom an external source (e.g.. via a local area network or the in,age u,.^, encoder 4 from source 2. As described 
hke) by redundant field/frame detection block 9. Redundant ^bove with respect to FIG. 2, the uncoded video data is 
field/frame detection btock 9 detects redundant fields in the subjected to DCT prxxxssing in order to generate DCT 
input video data, which may be present due to conversion „ coefficients for AC frequency bands in the video data, 
between mouon picture video and television video, and flags whereafter the video data is transmitted to matrix 
these redundant fields to mdicate that they should not be calculation/extraction blodc 17. There, step S302 determines 
coded Encoder 4 also mcludcs a decision block (not ^^ether the uncoded video data has previously been sub- 
shown), which may comprise part of redundant fieU/frame jeced to a coding operation. In terms of this disclosure, 
detection block 9, which determines whether video data y^^^ ^y^b has not previously been subjected to a 
represents an intra picture (i.e."r), meaning that eadi frame coding operation comprises both video data captured 
thereof is coded independendy, or a predictive picture (i.e., ^^^^^^y j„ ^^^31 j ^as 
"FO, meaning that each from thereof is coded based on other previously been "lighUy coded", meaning that the effects of 
frames using motion estimation or the like. While the such "Ught coding" were so slight as to introduce little or no 
present mvention is intended for use with intra pictures, the ^ quantization noise into the video data, 
invention is not necessarily limited to such use. In this determine whether the video data has previously been 
regard, it is noted that reduction in quantization no^e of ^^^jected to a coding operation, step S302 examines the first 
mtra pictares in accordance with the pr^nt mvention also frequency band in the image represented by the video 
has the eflfecl of improvmg the quality of non-intra pictures, ^^j^ Specifically, step S302 generates a histogram of DCT 
since non-mtra pictures base their motion estmiaUon on 3^ coefficients for the (0,l)th AC frequency band. The histo- 
thesemtrapictur^ gram is generated as follows: 

Returmng to FIG. 2, macro-block color classification 

block 10 classifies macro-blocks of video data based on — , (Eq l) 

texture and composition and transmits relevant information ''o.i = 2j ^' 

to adaptive quantization control block 19, described below. 35 ^^0.0-* 
ME & ME mode selection block 27 performs motion 

estimation for non-intra picture input video data based on where "H" corresponds to the histogram, "C corresponds to 

the input video data and a previous frame of video data a DCT coefficient in the AC frequency band, and takes 

stored in frame store 26. Adaptive field/frame decision block values between 0 and 2048, these values corresponding 

14 determines whether to perform field or frame DCT on the 40 to possible values of the (0,l)th frequency band. Thereafter, 

input video data and selects the appropriate one of DCT the histogram is examined by performing an auto-correlation 

blocks 11 and 12 to perform the appropriate type of DCT on function thereon. SpecificaUy, the auto-correlation function^ 

the video data. First pass coding block 16 gathers informa- / of Hq is determined as follows: 
tion relating to coding the video data and passes this 

information along to adaptive quantization control block 19 45 ^iKoti (Eq 2) 

and to matrix calculation/extraction block 17. ~ 2 ^^^^^ 

As noted above, matrix calculation/extraction block 17 '^o 
substantially implements the present invention. That is, 

matrix calculation/extraction block 17 determines if input , 1 , . ^ , -,n>.r. 1 

, , viLi^iuw> xi. lupui where"d takes on values between 0 and 2048, these values . 

uncoded video data has been previously coded. In a case that 50 j- * n.i t r *i_ i\*t- c_ — 

. , J w Aiiawu>wuuai 5u corresponding to possible values of the (0,l)lh frequency-^ 

Oie video data has been previously coded, for each AC J^^^ ^ ^^^^ ^ highest pljiible value of 

frequency band m each 8x8 pixel block m the video data, ^^^^^ frequency biid. Once Ro ,^is fo^d for all «d's", 

matrix calculation/extraction block 17 generates a ^^ep s302 determines the location of the maximum value of 

histogram, calculates a correlation function for the j> d namely D as follows- 

histogram, identifies a maximum value of the' correlation 55 » > 04' 

function, and determines a quantization weight using the Do,i=aigmax R04 (b»o, (Eq. 3) 

maximum value of the correlation fiinction. The extracted where Dq j corresponds to a distance between reconstruction 

quantization weights are then passed to second pass coding levels of' the (0,l)th AC frequency band. In this regard, 

block 20. A more detailed description of the operation of reconstruction levels represent reconstructed quantized DCT ~ 

matrix calculation/extraction block 17 is set forth in detail 60 coefiBcients for the (0,l)th AC frequency band, 

below. It has been determined experimentally that if Dq^i has a 

Rate allocation block 22 determines a remaining number value which is less than or equal to 4, video data for the 

of available bits in the encoder's bit budget, and passes this image has not previously been subjected to a coding 

information along to adaptive quantization control block 19. operation, i.e., the video data comprises either data captured 

Adaptive quantization control block 19 determines how 65 from an original image or has been "lightly coded". Using 

various sections of the video data should be coded based 00 this information, step S302 examines Dq ^ in order to 

information from rate allocation block 22, macro-block determine whether to proceed to step S303 or step S305. 
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Id a case that step S3Q2 determines that the video data has Values of q„ may vary for different macro-blocks of an 
not previously been subjected to a coding operation, pro- image being coded. However, due to the way in which 
ccssing proceeds to step S303. Step S303 codes the video MPEG2 coding is performed, namely the type of adaptive 
data using a either a pre-stored, default quantization matrix quantization used during coding, q„ generally varies around 
or a calculated quantization matrix. The default quantization 5 a dominant value Q. Locations of these dominant recon- 
matnx may be stored in a memory of the encoder at the time struclion levels (hereinafter "DRLs'O for the (k4)th frc- 
of Its manufacture, or may be obtamed from an external ^and can be determined as follows 

source. The calculated quantization matrix may be calcu- 
lated in accordance with a known matrix calculation rp ^\ 
method, such as that described in co-pending U.S. patent DRiij = i*-^. ^ 
appUcation Ser. No. 08/991,521, entitled "Method Of 
Frame-By-Frame Calculation Of Quantization Matrices", 

filed on Dec. 16. 1997 (Assignee Ref. No.: PHA 23321), the Examples of DRLs are Ubeled 35, 36 and 37 in the histo- 
contents of which are hereby incorporated by reference into gram of FIG. 5. 

the subject appUcation as if set forth herein in full. In any a differentiated dominant reconstruction level 
event, following coding using, e.g., the default matrix, (hereinafter "DDRL") corresponds to the distance between 
processing proceeds to step S304, in which the coded video adjacent DRLs. as shown in FIG. 5. Since distances between 
data is transmitted as a bit stream, adjacent DRLs do not depend on index value "i", the DDRL 

On the other hand, if step S302 determines that the video for DCT coefficients in the (k4)th frequency band is simply 
data has previously been subjected to a coding operation, determined by removing "i" from equation 5, as follows: 
processing proceeds to step S305. Step S305 extracts, from 

the video data, quantization matrices that correspond to w^jQ (Eq.6) 

those used in a previous coding operation. This is possible DDRLkj = 

in MPEG2 because coding parameters, such as the quanti- 
zation matrices noted above, can be determined based on 

reconstruction levels of OCT coefficients of uncoded video ^ quantization weight Wj^^ for DCT coefficients in 

data. Specifically, quantized DCT coefficients in the (k,l)th frequency band can be determined simply by 

uncoded video data correspond to a set of reconstruction solving for W;^.;* ^ follows: 
levels which are determined based on quantization weights 

from a previous coding operation. For each frequency band, ^ 16ddri^ (Eq. 7) 

these reconstruction levels cluster around a series of domi- G 
nant reconstruction levels. Thus, the quantized DCT coef- 
ficients also chister around those dominant reconstruction « ^ ^ ^ . . . 
levels. By analyzing chistering structures of the quantized ^oidex to determine W^, it is necessary to 
DCT coefficients, it is possible to recover the quantization determme DDRL^ and Q. Values of these vanables are 
weights from the previous coding operation, together with determined as foUows 

other quantization parameters, if desired. Tliese recovered "^^^^ ^ ^^^^^ ^"^S ^"^'^^ '^^^^ ^P°° 

quantization weights can then be used to code the video data ^^^^^^^ quantization weights in a quanU- 

with lower additional quantization noise than if the video nation matrix. Accordingly, it is only necessary to toem^^^^ 
data were coded using calculated quantization weights. 'f^^""^ magnitudes of quantization weights W,,,. 

FIG. 4 shows a histogram of DCT coefficients for an AC ^ Tlierefore, an arbitrary value m th^^ case 16. can be set as 
frequency band of an image which has not been coded HG. ^? quantization weight for the (0,l)th AC frequency band 
5 shows a histogram of that same AC frequency band, in W^., . Accordmgly, subsumtmg 16 for Wq.^ m the foUowing 
which the video data has previously been coded. As shown equation 
in FIG. 5, the histogram for the previously-coded video data 

shows a series of ^ikes, examples of which are labeled, 34, ddrLq^i = ^^-^ 

35, and 36. These spikes correspond to reconstruction levels 
for DCT coefficients of the AC frequency band under 
examination. It is the spacing between these spikes that yields 
contains information relating to the quantization weight 

which was previously used to code that AC frequency band. O-DDRi^j. (Eq. 9) 

More specifically, in MPEG 2, locations of reconstruction ^ r^v. c r^r^nr ^ 1.1 

levels (hereinafter "RLs") of an AC frequency band are ^^"^ ^ ^ ^^^^ ^ ^ ^DRL, the problem 

determined based on a quantization weight used in a previ- ^L^'J , "''T' .^ Z^ 

ous coding operation, together with a related quantizer scale k ^ coefficients of oorrespondmg AC fre- 

value. 'nius,locationsofRUfora(k4)lh frequency band of ^T?" ? ^ "l""" u"" I 

an 8x8 pixel block in an (m)th macrcvblock are determined steps which are smular to those described above for 

asfoUows: finding Dq^. 

Referring to FIG, 6, in step S601 process steps are 
executed to generate a histogram for DCT coefficients in the 
Riij = i* 60 (kjl)th AC frequency band in the input video data. This 

16 



histogram is generated as follows: 



where W^^^ comprises a (k,l)lh entry in a quantization matrix _ y j (Eq. 10) 

corresponding to the 8x8 pixel block, q„ comprises the i^^^v 
quantizer scale value for the (m)th macro-block (see above), 65 
and "i", which can take on values of 0, ±1, ±2, ... , 

comprises an index vahie of each reconstruction level. where the variables are as defined above. 
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Next, in step S602, process steps are executed to calculate In preferred embodiments of the invention, if two peaks 

an auto-correlation function for Hjy\ Step S602 may cal- in the histogram are within a predetermined distance of each 

culate the standard auto-correlation fiinction for the (k4)th other, e.g., they are relatively close to each other, an average 

frequency band, which is: value of the locations of the peaks is determined, and the 

5 DDRL is set to this average value. This feature of the 

(Eq. 1 1) invention reduces the likelihood of errors in the DDRL. In 

Yd addition, several other corrections also may be performed in 

order to reduce errors in the DDRL. 
Specifically, in order to correct for the presence of over- 

where the variables are as defined above. ^0 lapping peaks in reconstruction levels of the histogram, the 

In preferred embodiments of the invention, however, step invention examines additional maximum values of the cor- 

S602 executes either the standard auto-correlation function relation function. That is, if peak overlap occurs, the auto- 

or a modified auto-correlation function. Hie modified cor- correlation function of the histogram may give larger values 

relation fiinction performs correlation on less than all peaks than at higher lags. ConsequenUy, the true 

in the histogram. Which function is used depends upon the DDRL may not correspond to the largest histogram value, 

number of peaks in the histogram of the DCT cocfBdents of correspond to the second largest value, the third 

the (kj[)th frequency band. That is, in these embodiments of ^^g^^t value, etc. In these cases, the overlapping peaks are 

the invention, the number of peaks in the histogram is disregarded, and additional values of the correlation func- 

determined, and if the number of peaks is less than or equal examined in order to determine which value corre- 

to an arbitrary pre-set value, the standard auto-correlation sponds to the DDRL. 

function (defined above) is performed for the histogram. In ^^^^^ ^"^^ determining the DDRL due to 

contrast, if the number of peaks is greater than the pre-set ^® harmonics of the base DDRL. This can cause a quanti- 

value, then the modified auto-correlation function is per- ^^^^^^ -weight to be extracted which is a multiple of the real 

formed for the histogram. This feature of the invention is quantization weight. To correct for potential errors of this 

particularly useful in computing systems having limited ^ second peaks of the correlation function are 

capacity, since it reduces the amount of calculations required compared to determine which of the first and second peaks 

by reducing the number of peaks for which the auto- ^ ^^^^^ ^i^h is smaller. It is then determined whether 

correlation function is calculated. ^^^B^^ ^ * harmonic of the smaller peak. In a case 

In a particularly preferred embodiment of the invention, ^'Ser peak is a harmonic of the smaller peak, the 

step S602 generates either the standard auto-correlation smaller peak is set as the maximum value, i.e., the DDRL, 

function or the modified auto-correlation function using and in a case that larger peak is not a harmonic of the smaUer 

values which are different from *V and "d" above. In this P^^k, the larger peak is set as the maximum value, 

embodiment, the histogram is generated in step S601 for ^^^^ distances between different peaks in a 

values of 'Sr^ ranging from 0 to 511. Next, maxj^j is found histogram may not be identical, the invention takes an 

based on the histogram. Variable TP is assigned to the average of these distances in order to find the most accurate 

minimum of 10 and maxjt^,. The maximum histogram y-axis ooeflScients of the (kji)th frequency 

values are then foimd for the variable IV and their corre- band. 

sponding x-axis vahies stored in y/^i^i^^'^ . Thereafter, the DDRLjt^ has been determined, processing proceeds 

auto-correlation fimction in step S602 is computed for V^^ ^ step S604. In step S604 the quantization weight, Wj^^, for 

for i«0 to TP-1, as follows: * ^ (k4)lh AC frequency band is determined by substituting 

(1) if V/'^4[ then DDRLj^^ found in equation 14, and Q (i.e., DDRLq J found 

in equation 9, into equation 7. This is done for all (kj) 

^ (Eq 12) frequency bands in an 8x8 pixel block, thereby yielding the 



^yf* f/j^Dvl* j^^^ quantization matrix therefor FIG. 7 shows a comparison of 

45 quantization weights extracted in accordance with the 

present invention to quantization weights from a default 

matrix used to code original video data. As is dear from 

(2) if V/'>4, then FIG. 7, there is substantial correspondence between the 

extracted quantization weights and the default quantization 

^^/v.w-i (Eq. 13) 50 weights. 

= y. Hf/mJHf'/-\ ni'/^H'^/^'i FoUowing step S604, processing proceeds to step S306 of 

j=o FIG. 3. In step S306, the video data is coded (in second pass 

coding block 20) using a quantization matrix extracted in 

accordance with the invention, whereafter the coded video 
where the variables are as defined above. 55 ^^^^ ^ ^^^^^ ^ ^^^^^^ 53^4 

Retunnng to FIG. 6, once step S602 calculates an auto- FIGS. 8A to 8C show the results of simulations comparing 

correlabon fiinction for the histogram generated in step himfnance (Y) signals and chroma (U.V) signals generated 

S601, the maximum value of the autoncorrelation frmction, ^^^^j ^^^^^ ^ luminance and chroma signals 

R^^, IS identified m step S603. The location of this maxunum generated using matrices extracted in accordance with the 

value of R;^^ IS set as the DDRL for the (k.l)th AC frequency ^^^^^^ invention and to luminance and chroma signals 

band. Specifically. generated using matrices calculated in accordance with 

DDRLtr-argmaxRi/dp.0, (Eq. 14) conventional methods. As is clear from the figures, the 

luminance and chroma signals generated using matrices 

where the variables are as defined above. For example, in extracted in accordance with the present invention are closer 

FIG. 5, peak 35 corresponds to the maximum value Rj^^ of 6S to those generated using default matrices than are the 

the histogram shown therein. Thus, the DDRL comprises the luminance and chroma signals generated using matrices 

distance between "0*' and the location of peak 35. calculated in accordance with conventional methods. RGS. 
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9A to 9C show results of another simulation confirming the 
results of FIGS. 8A to 8C. 

The present invention has been described with respect to 
particular illustrative embodiments. It is to be understood 
that the invention is not limited to the above-described 5 
embodiments and modifications thereto, and that various 
changes and modifications may be made by those of ordi- 
nary skill in the art without departing from the spirit and 
scope of the appended claims. 

What is claimed is: 

1. A method of extracting coding parameters from 
uncoded video data, where the coding parameters corre- 
spond substantially to coding parameters used in a coding 
operation that was previously performed on the video data, 
the method comprising the steps of: 

generating a histogram for at least one AC frequency band 

in the uncoded video data; 
calculating a correlation function for the histogram; 
identifying a maximum value of the correlation function; 

and 

determining a coding parameter for the at least one AC 
frequency band using the maximum value of the cor- 
relation function: 

wherein the step of calculating the coaelation function 
includes: 

identifying a predetermined number of peaks in the ^ 
histogram of the uncoded video data; and 

calculating (i) a standard auto-correlation function for 
the histogram in a case that the number of peaks is 
less than or equal to a predetermined value, where 
the standard auto-correlation function is performed 
for all identified peaks, and (ii) a modified auto- 
correlation function for the histogram in a case that 
the number of peaks is greater than the predeter- 
mined value, where the modified auto-correlation 
function is performed for less than all of the identi- 
fied peaks. 

2. A method according to claim 1, wherein the standard 
auto-correlation function for a (kj[)th AC frequency band 
comprises: 



40 



45 



wherein the modified auto-correlation function for the 
(k,l)th frequency band comprises: 



SO 



where R;^ is a value of the auto correlation function for 
a (kj[)th AC frequency band, Hj^^ corresponds to his- 55 
togram values for the frequency band, maxjt^^ is a 
maximum value of a coefficient for the (kj[)th AC 
frequency band, and V^'' corresponds to a peak value of 
the histogram. 

3. A method according to claim 1, further comprising the 60 
step of performing a discrete cosine transform (DCT) on the 

at least one frequency band prior to the generating step. 

4. A method of extracting coding parameters from 
uncoded video data, where the coding parameters corre- 
spond substantially to coding parameters used in a coding 65 
operation that was previously performed on the video data, 
the method comprising the steps of: 



generating a histogram for at least one AC frequency band 

in the uncoded video data; 
calculating a correlation function for the histogram; 
identifying a maximum value of the correlation function; 

and 

determining a coding parameter for the at least one AC 
frequency band using the maximum value of the cor- 
relation 

wherein the maximum value of the correlation function 
corresponds to a differentiated dominant reconstruction 
level (DDRL) of the at least one AC frequency band 
such that, in the determining step, a coding parameter, 
Wjy, for a (kj) the frequency band is determined by 
solving the following equation: 



frinction: 

where Q defines an adaptive qiiantization value of a block 
of video data which includes the (k4)th frequency band. 

5. A method according to claim 4, wherein the identifying 
step comprises determining whether peaks in the histogram 
of the video data overlap and, in a case that peaks overlap, 
examining additional values of the correlation function in 
order to determine the DDRL. 

6. A method according to claim 4, wherein the identifying 
step comprises: 

comparing first and second peaks of the correlation func- 
tion to determine which of the first and second peaks is 
larger and Miiich is smaller; and 

determining whether the larger peak is a harmonic of the 
smaUer peak; 

wherein, (i) in a case that the larger peak is a harmonic of 
the smaller peak, the smaller peak is set as the maxi- 
mum value, and (ii) in a case that larger peak is not a 
harmonic of the smaller peak, the larger peak is set as 
the maximum value. 

7. A method according to claim 4, wherein DDRL^^^ 
comprises an average of distances between reconstruction 
levels in the histogram of the video data. 

8. A coding apparatus which extracts coding parameters 
from imcoded video data, where the coding parameters 
correspond substantially to coding parameters used in a 
coding operation that was previously performed on the video 
data, the coding apparatus comprising: 

a memory which stores computer-executable process 
steps; and 

a processor which executes the computer-executable pro- 
cess steps stored in the memory so as (i) to generate a 
histogram for at least one AC frequency band in the 
uncoded video data, (ii) to calculate a correlation 
function for the histogram, (iii) to identify a maximum 
value of the correlation function, and (iv) to determine 
a coding parameter for the at least one AC frequency 
band using the maximum value of the correlation 
function; 

wherein the computer-executable process steps to calcu- 
late the correlation function comprise steps to identify 
a predetermined number of peaks in the histogram of 
the uncoded video data, and to calculate (i) a standard 
auto-correlation function for the histogram in a case 
that the number of peaks is less than or equal to a 
predetermined value, where the standard auto- 
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correlation function is performed for all identified 
peaks, and (ii) a modified auto-correlation function for 
the histogram in a case that the number of peaks is 
greater than the predetermined value, where the modi- 
fied auto-correlation function is performed for less than 
all of the identified peaks. 

9. A coding apparatus according to claim S, wherein the 
processor executes computer-executable process steps so as 
to perform a discrete cosine transform (DCT) on the at least 
one fi^quency band prior to generating the histogram. 

10. A coding apparatus according to claim 8, wherein the 
maximum value of the correlation function corresponds to a 
differentiated dominant reconstruction level (DDRL) of the 
at least one frequency band; and 

wherein the processor executes process steps to determine 
the coding parameter based on the DDRL. 

U. Aooding apparatus according to claim 10, wherein the 
DDRL comprises an average of distances between recon- 
sUiiction levels in the histogram of the video data. 

12. A coding apparatus which extracts coding parameters 
from uncodcd video data, where the coding parameters 
correspond substantially to coding parameters used in a 
coding operation that was previously performed on the video 
data, the coding apparatus comprising: 

a memory which stores computer-executable process 
steps; and 

a processor which executes the computer-executable pro- 
cess steps stored in the memory so as (i) to generate a 
histogram for at least one AC fi^uency band in the 
uncoded video data, (u) to calculate a correlation 
function for the histogram, (iii) to identify a maximum 
value of the correlation function, and (iv) to determine 
a coding parameter for the at least one AC &equency 
band using the maximiun value of the correlation 
function; 
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wherein, to identify the maximum value of the correlation 
function, the processor executes process steps to deter- 
mine whether peaks in the histogram of the video data 
overlap and, in a case that peaks overlap, to examine 
additional values of the correlation function in order to 
determine a differentiated dominant reconstruction 
level (DDRL) of the at least one frequency band. 
13. A coding apparatus which extracts coding parameters 
from imcoded video data, where the coding parameters 
^ correspond substantially to coding parameters used in a 
coding operation that was previously performed on the video 
data, the coding apparatus comprising: 
a memory which stores computer-executable process 
steps; and 

, a processor which executes the computer-executable pro- 
' cess steps stored in the memory so as (i) to generate a 
histogram for at least one AC frequency band in the 
uncoded video data, (ii) to calculate a correlation 
function for the histogram, (iii) to identify a maximum 
value of the correlation function, and (iv) to determine 
* a coding parameter for the at least one AC frequency 
band using the maximum value of the correlation 
function; 

wherein, to identify the maximum value of the correlation 
. function, the processor executes process steps to com- 
pare first and second peaks of the correlation function 
to determine which of the first and second peaks is 
larger and which is smaller, and to determine whether 
the larger peak is a harmonic of the smaller peak; and 
J wherein, (i) in a case that the larger peak is a harmonic of 
the smaller peak, the smaller pe^ is set as the maxi- 
mum value, and (ii) in a case that the larger peak is not 
a harmonic of the smaller peak, the larger peak is set as 
the maximum value. 
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